Corpus: hun-eu_web_2014_100K

Other corpora

4.4.1.5 Number of Word-N-grams at Sentence Endings

Number of word-N-grams for N=1...5 for the first K sentences

K # of words # of bigrams # of trigrams # of 4-grams # of 5-grams
100 96 97 99 99 99
1000 843 976 989 995 998
10000 6910 9614 9893 9937 9957
100000 46627 88536 97430 98786 99281
1000000 46627 88537 97431 98787 99282


Zipf's diagram for sentence endings


Gnuplot diagram

7447 msec needed at 2018-04-27 10:25